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We discuss reasons why a probability amplitude, which becomes a 
^ I probability density after squaring, is considered as one of the most ba- 

C^ ■ sic ingredients of quantum mechanics. First, the Heisenberg/Schrodinger 

^ , ' equation, an equation of motion in quantum mechanics, describes a time 

evolution of the probability amplitude rather than of a probability density. 
There may be reasons why dynamics of a physical system are described by 
rvQ ' amplitude. In order to investigate one role of the probability amplitude 

^ , in quantum mechanics, specialised code-transfer experiments are designed 

^^ ' using classical information theory. Within this context, quantum mechan- 

Cn I ics based on probability amplitude provides the following: i) a minimum 

00 . error of the code transfer; ii) this error is independent of coding param- 

eters; and iii) nontrivial and nonlocal correlation can be realised. These 
^^ , are considered essential advantages of the probability amplitude over the 

c7^ ' probability density. 

m 

1 Introduction 

p\ . Quantum mechanics (QM) is considered the most basic theory of nature. All 



phenomena including those of the gravitational force are considered to be ex- 
pressed by a language of QM. However, an essential understanding of the basic 
nature of QM yet to be realised, and efforts to look for more fundamental expla- 
nations continue. Of course, QM itself is a self-consistent theory and requires 
no fundamental reasoning to support its truths beyond what are gains from ex- 
periments. Still, it is worth pursuing more basic reasons which determine QM 
to be the most fundamental law of nature. For instance, Wheeler asked "Why 
the quantum?" and discussed the relation between QM and information theory 
[131 [2]. In this report we attempt to answer the same question from Wheeler's 
point of view. 
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One of the most essential differences between quantum and classical mechan- 
ics is the former's need for a probabilistic treatment of theoretical predictions. 
One cannot avoid the probabilistic interpretation of a wave function proposed by 
Born [2] , which is now known as the Copenhagen interpretation. A fundamental 
equation of QM, the Heisenberg/Schrodinger equation, does not describe the 
behavior of a physical observable nor its probability density; rather, it describes 
the probability amplitude, which is a characteristic of QM and possesses no 
classical counterpart. This report considers reasons why fundamental laws of 
physics are described by probability amplitude instead of probability density, 
leaving aside the question of why probability itself is necessary. To clarify essen- 
tial properties of probability amplitude, code-transfer experiments are designed 
on the basis of classical information theory. Taking into account the discussions 
on these experiments, three essential advantages of probability amplitude over 
probability density are pointed out in the following sections. 

First, a definition of probability density is given in Section 2 under a very 
general mathematical framework. Then, code-transfer experiments are designed 
within classical information theory to investigate the role of amplitude. Exper- 
iments using a stochastic algorithm cannot avoid statistical error due to sample 
number. In Section 3, we show that a coding method based on probability 
amplitude should minimise statistical error. Moreover, statistical errors of the 
code-transfer are independent of the parametrisation allowing each character to 
be transferred; this is shown in Section 4. Another essential feature of QM is 
its lack of local realism, which can be judged by Bell's inequality. This local 
realism and Bell's inequality are described using terminology of classical in- 
formation theory, again as the code-transfer experiment. A method based on 
the probability amplitude can induce a violation of Bell's inequality, as shown 
in Section 5. Throughout this report, classical information theory is used to 
describe code-transfer experiments. 

2 General quantum system 

A general framework to define the probability amplitude appearing in QM is 
considered in this section. Here we emphasise algebraic aspects of QM and 
ignore dynamical ones. The question which must be asked here is "What mini- 
mum set of assumptions makes a system look like a quantum system?" Besides 
a space-time manifold as a stage of all physical phenomena, we propose the fol- 
lowing elements as indispensable ingredients for QM of a single particle system. 

Definition 2.1. (Amplitude and probability measure) 

K is assumed to be some field (including a noncommutative one) and V is a 
linear (vector) space on it. K is named as a base field and is associated to each 
point of the space-time manifold. Probability amplitude and probability measure 
are introduced on these spaces as follows. 

1. A map from a point on a base set Ad to a vector space V , 

il^VL'. M ^V : X ^ V'K(a;), 

is named an amplitude. Here, Ai is the base set and x is a point on it. In 
order to define a measure on the vector space, probability amplitudes are 
assumed to form a a-algebra of the base field K. 



2. A map from the amplitude to a real number 

^■.V ^R:^K^ m(V'k) G [0, 1] 

is named a probability (measure) . The sequential map 

fj. o ip^ : M ^ {V -^)R : x ^ fi{x) = /x(V'k)(x) 

is also called a probability and represented by the same symbol, lJ-{x), when 
K and V are obvious. The probability measure must satisfy 



Y^ fi{ipK){x) = 1. 



3. The set {K, V, ?/;K,/i} is named a "quantum space." 

For standard relativistic (nonrelativistic) QM, we take Hilbert space as a 
vector-space 1^ on a field of complex numbers C with probability measure of 
IJ,{4') = 1^1^ in the Minkowski (Galilean) manifold as a base set. (Sometimes a 
Fourier transformation of ^ defined in the momentum manifold is used instead 
of ■0 itself.) In standard QM, probability amplitudes can be constructed using 
square integrable functions on a given support. Besides this standard quan- 
tum space, there are several approaches to constructing quantum mechanics on 
spaces other than the Hilbert space on complex numbers. After a pioneering 
work of Nambu |10| on quantum mechanics on the Galois field, QM on finite 
fields are discussed [21 HH \5\. In this report, simple quantum spaces are used 
since only algebraic aspects of QM are of interest here. 

3 Minimisation of measurement error 

First, let us consider a statistical error for measurements of a single physical ob- 
servable on the quantum space defined in the previous section. A code-transfer 
experiment simulating standard QM in a much simpler quantum space, retain- 
ing essential properties, is introduced here. In information theory, an encoding 
method which minimises statistical error among methods using stochastic algo- 
rithms is known. The method using probability amplitude is shown to be an 
example of such an encoding method giving minimum errors. Terminology of 
classical information theory used here can be found in Appendix A.l and [9]. 

Definition 3.1. (Stochastic code-transfer experiment) 

The experiment satisfying the following conditions is called a stochastic code- 
transfer experiment: 

1. Alice (A) transfers a set of m different letters V ~ {ipi, ■ ■ • ,'>Pm} to Bob 
after encoding them. 

2. Bob (B) receives codes sent from A and decodes them to a set of letters. 

3. Encoding is performed using the following probabilistic function: 

/I : y ^ K : m(V^,) ^ P. e [0,1]. 




Figure 1: Example for rn = 3: assignment of a vector V on two-dimensional sphere. 



4- A can send a finite number (n times here) of the same code to B. 

5. B obtains n independent codes from. A, such as Sf^n = {^i,''' ,Xn}. 
Each datum is Xk = {xk.i,-'' i^k.m}, where Xkj G [0,1] for k — 
1 , • • • ,n and j = 1 , ■ • • ,m. 

6. B has an unbiased estimator to obtain a set of letters (V ) from measured 
data as 
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fe=l 

where {ipi, • • • , tpm} is a measured set of letters. The probability to obtain 
Xk,j — 1 {xk.j — 0) by B is pi (1 — Pi) as encoded by A; then each ipi 
may be the same as the original ipi when n — ?> oo. However, with a finite 
number of data, there is a possibility of choosing a wrong ipi on account 
of the statistical error of measuring Xi . 

This code-transfer experiment is constructed on the quantum space {M, V, tpi, /x} 
as defined above. In this case, a base set has only two points, i.e., where A or 
B exists. No dynamical structure is assumed to transport a code from A to B. 
It is assumed that these two points are separated from each other and there is 
no way to communicate other than the code transfer. A question to ask here is 
how may one find the method to encode the letter ipi to a real number fj.{ipi) to 
minimise an error of this experiment for any tpi . The answer is already known as 
a theorem, which was first obtained by Fisher [5] . Wootters stated this theorem 
|15) without any proof but later provided the same by introducing a statistical 
distance |16) . Here we state the theorem clearly again and give an independent 
proof. 

Theorem 3.2. (Fisher Wootters) 

Among stochastic code-transfer experiments, that which employs the following 



encoding rules gives the smallest error to measure a single letter from a set of 
letters: 

1. Select a set of letters V — {ipi, ■ ■ • , V'm}- 

2. Prepare an m- dimensional Euclid space M™ and orthonormal bases {771, • • • , rjm}- 

3. Map V at a point on a unit sphere S^~ centered at the origin of M™. 
4- Set the probability to obtain ipi to be n{ipi) = |yip, where 7]i is an i's 



component of the position of V on 5™ 
above. 



by the orthonormal bases defined 



Proof. 1. The smallest error => n{ipi) = Ij/ip: 

Data after n independent measurements are expressed as ^n = (^1 : 
which satisfies a fifth condition of Definition 13.11 The third condition 
follows from the third assumption of the theorem. When the probability 
density to obtain a set of data jr„ is expressed as f(jr„; V) — fJ.{ipi), the 
Fisher information matrix (FIM) [^ can be written as 



:^n), 



Jj' 






^\ogi{X^;V) dlogi{^n;V) 
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dipi dtpj 

dn{ipk)/dipi dn{ipk)/dipj 



Ki^k) 



M(V'fc) 



The functions fJ-{ipi) are not independent of each other owing to conser- 
vation of the total probability, J2™=i M(V'fe) = 1- We can assume that aU 
H^Tpk) {k > 2) are independent except /x('(/'i) = 1 — X]fcL2 /^(V"/:) without 
any loss of generality. Here we use a short-hand expression, /i('0i) — IJ-i, 
d^i{4'k)/d'4'i = Hk.i, and X]fcL2 M(V'fc) = M! then the diagonal components 
of FIM can be written as 



'Jii — 
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The minimum value of Ju is obtained when p, = fii within the allowed 
region oi fii < /i < 1 . Then we get 



^X^iij — ^ii — 



\^^^,^\ 



^iiil- fj,,)' 



On the other hand, measured data after n independent measurements 
must follow a multinomial distribution, whose covariance matrix cr is 

\npi{l-p,) (i^j), 



where pi is measured probability of an ith letter. Then, after n inde- 
pendent measurements through estimator T defined in Definition 2.6, a 
covariant matrix S(p) can be expressed as 

1 " 

^ij\P) ~ ~ / ^ [^k.i ~ XiJlXkJ — Xj) 



1 
— (T. 

n 



fe=i 

'ij, 



Then, diagonal components of the covariant matrix become 



1 

— Cii 

n 



In general, measured probability (pi) differs from true probability (ni); 
however, it is certain that the error of |pi — /ii| will be less than any 
small value after a sufficient number of events accumulates, as a result 
of the law of large numbers and the assumption that the estimator is 
unbiased. Then, we use fii instead of pi in the discussions that follow. 
The probability fXi that maximises diagonal components of the covariant 
matrix is given as fii = 1/2 due to dJ^u/dfii = (1 — 2/ii) = 0. Then, the 
diagonal components of the covariant matrix are given as S^ — 1/4. The 
Cramer-Rao inequality [11] [7j [6] gives the lower bound of the covariant 
matrix as 

A possible range of the inverse of the FIM is 



J^ > J^ > 0. 



|M«, 



Then a solution of the following differential equation gives the minimum 
variance in general: 



1 



yUj(l - ^J.i) 



dfii 



dipi 
dni 



d^pi 



4/Xi (1 - ^i) . 



The solution of this equation can be obtained as 

/^(V'i) = cos^ V'i and sin^ ipi. 

From these two solutions, we employ /i(V'i) = cos^ ipi. This selectiorl]J has 
nothing to do with the discussions that follow. Each ipi gives the same 
differential equation; then parametrisation y^ — \/n{ipi) = cos-^i gives 
the lowest value of the variance, which is nothing other than the direction 
cosine of the vector V, whose endpoint is on the unit sphere S^~^. Then, 



-"^In this selection, cos^i is called a directional cosine of a vector V. 
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Figure 2: Code-transfer experiment realised by using polarised laser beam. 



the method to give the minimuni variance is: i) normahse the letter V'i to 
< V'j ^ '""Z^; ii) map on the S^~^ as f/'i to be an angle from axis r/^; set 
iii) the probability to observe the letter ipi to be cos^V'ij which are the 
same as the assumptions of the theorem. 

2. fi{ipi) = \yi\'^ => the smallest error: 

When we set niipi) = l2/i|^ = cos^-^i: the diagonal components of a co- 
variant matrix become 

= COS^ ipi{l — COS^ Vi) 

= COS V'i sin ipi. 

Then the minimum value of S^ is obtained to be 1/4 at ^pi = 7r/4. On 
the other hand, the diagonal component of the FIM matrix can be 



^ii 



d^iil^i 



dipi 



1 



M(V'i)(l - fJ^iipi)) 
= 4 cos^ Vi sin^ ipi I (cos^ Vi sin^ Vi ) 
= 4. 

Then J~^ ~ 1/4, which matches the minimum value of E^ 



□ 



In the above decoding method, a relation between probability amplitude 
and density is algebraically the same as in the standard QM, which means the 
latter employs a coding method that minimises statistical error among other 
stochastic methods. This is our first example outlining the advantage of the 
method using probability amplitude. 



4 Parametrisation independence of a measure- 
ment error 

Related to the Theorem 13.21 one can prove following theorem, which is also 
given by Wootters |15II16) and is important to consider one role of the probability 
amplitude. 



Theorem 4.1. 

The encoding rule given by the Fisher-Wootters theorem gives uniform errors 
independent of its parametrisation. 

Proof. A set of letters V = {V'l, ■ • ■ , "0™} are encoded as yi = cos^i, J27Li Vi — 
1, according to the Fisher-Wootters theorem. Looking at an ith letter ipi, one 
sees a relation between an error of letter-estimation Sipi and an error to measure 
the parameter dyi as \Syi/Sipi\ = sinipi. Under the normalisation condition of 
Si!Li Vi — 12^1 cos^ ipi — I, the total error after measuring n independent data 
becomes 

a^ = 



1 "* 
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which means a mean-square error is determined by the statistics per degree of 
freedom and independent of the position on an Tn-dimensional sphere. A factor 
cr^ oc l/ri follows from the central limit theorem. D 

Example 4.2. (Code-transfer experiment realised using a polarised 
laser beam) 

Let us consider the code-transfer experiment defined in Definition 13. II for a re- 
alistic quantum system: the pulse laser has a polariser (\/A plate). (See Fig. \^) 
Alice (A) has experimental equipment consisting of a pulse laser and a polariser 
and can transfer a single photon with linear polarisation with any polarisation 
plane to Bob (B). B has a A/4 plate with fixed plane and photon detector with 
100% efficiency. A knows the angle of the polariser plane of B, say 6q, and has 
a clock exactly synchronised to that of B. A assigns codes on equally separated 
points on a unit circle, and selects an integer, say j. Then A sets an angle of 
the polariser according to a code to be 6 = Oq -\- a, where a = j /^ir. A trans- 
fers one photon a second and n photons in total. B measures photons behind 
the A/4 plate. If B observes a photon, he records " 1" , and if not, he records 
"0". As a result B obtains data S; = {Xi,X2,--- ,X„} = {1, 1, 0, 1, 0, • • •}, 
and decodes them to one real number with average x = '^^ Xi/n. According 
to quantum mechanics this number must be x — sin a. Finally, B obtains a 
number which A intended to send. This code transfer experiment satisfies Def- 
inition 13.11 which means quantum mechanics gives code-transfer experiments 
with the smallest errors, given by Theorem. 13.21 



5 Nonlocal realism 

In order to discuss a correlation of two independent measurements, a double 
code-transfer experiment is designed as follows: 

Definition 5.1. (Stochastic double code-transfer experiment) 

A stochastic double code-transfer experiment is defined by extending Definition 

1. Alice {A) transfers two sets of m different letters, Va — {«!, • • • ,am} o^'^d 
Vb = {/3i, • • • ,/3m}7 to Bob and Charley after encoding them. 

2. Bob (B) and Charley (C) are placed opposite to A and receive codes sent 
from A, decoding one of the two sets stochastically to a set of letters. 
Neither B and C know which set is chosen by the other {independence of 
set selection). 

3. Encoding is performed using the following probabilistic function: 

M : Va ® V"b -> M : ^i{a.,,|ij) ^ pij G [0, 1]. 

4-. A can send a finite number (n times here) of the same code to B and C. 

5. B and C select for measurement one of the elements (letters) in Va or Vb, 

independently. Possible combinations of letters are {{ak,Oii), {ak, Pi), (/3fe , a; ) , (/3fc , /3/ ) } . 
A probability that B and C may obtain one of the above pairs is 



fi{ak,ai) = ^Pk^■PlJ, fi{ak,Pi) ^^ 



Pki -Pji, 



1,3 i-J 

KPk,ai) = ^Ptk -Pij, M(/3fe,A) =^Pik -Pji- 

i,j i,3 

6. Measurements: 

(a) B measures n independent codes from A, 9y^ = {Xf , • • • ,X^}. 
Each datum is X^ = {x^^, ■ ■ ■ ,x^^}, where xf ■ E [0, 1] for k — 
1, ■ • ■ ,n and j = I,--- ,m. Among n measurements, B chooses 
Nbol times for the letter set Va and N^js times for the letter set Vb ■ 

(b) The same as (a) with B -^ C. 

7. Estimator: 

(a) B has an unbiased estimator to obtain a set of letters from measured 
data: 






Ba _ 1 
Nbc 



where 6B(fc, •) is such that 

1 (when B choose a set Va for kth measurement) 
(when B choose a set V^ for kth measurement) 

(when B choose a set Vq for kth measurement) 

1 (when B choose a set \ p for kth measurement) 



<dB(k,a) = 
,OB{k,P) = 



(b) The same as (a) with B ^ C. 

Bell's inequality [T] is a critical test to distinguish a nonlocal theory from a local 
one. This theorem can be expressed by the language of classical information 
theory [3] . We state this theorem and give a proof in the context of Definition 

Theorem 5.2. (Bell) 

Let us consider a case with a complete table to give the probability of observing 
any pair of letters as 

^\(^in(^i2T Pjn PJ2) — l^y^ii 1 Pji)l^\^i2T Pj2): 

which means B measured a pair of letters {aij,/3jj^}, and C measured a pair 
of letters {ai^, Pj^}. These measurements are performed as the stochastic dou- 
ble code-transfer experiment defined above. In this case, a conditional entropy 
follows the inequality 

Proof. On the assumption there exists a complete probability table, P{ai-^ , ai^ , Pj-^ , /^jj), 
a joint entropjo can be written as 

il,i2,jl,J2 

= S{ai^ naji,/3ji n/3-,-J. 
Using the chain rule of entropy sequentially, one can get 

On the other hand, this joined entropy satisfies 

S'(ai2 nail, ^ji n/3jj = 'S'(aj2>"»i) + 5'(/3ii,/3j2|ai2>"ii)> 

= S'(aj2) + S'(aijai2). 

Inequality follows from nonnegativity of entropy. From the property of the 
probability measure in the probability space, 

VA,VB e Vc, P{Ar\B) < P{A), 

and the definition of joint entropy, the inequalities 

follow. Then Bell's inequality is proved. D 



^ Definitions and necessary formulae for following proof can be found in [6] and summarised 
in Appendix A. 2 
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The necessary condition for Bell's inequality, the existence of the complete 
probability table P{ai^ , ai^ , /?ji , /?J2 ) j corresponds to local realism in the physical 
terminology. Here, we give an example where Bell's inequality is not maintained. 



Definition 5.3. (Stochastic double code-transfer experiment without 
a complete probability table) 

Here, the number of letters in the set is m = 2 for simplicity. 



1. The same as Definition 15.11 1. 

2. Encoding is performed using following probabilistic function: 



whe 



I . 1 2 I ■ 1 2 



ipa '■ Va —5" M ; {ai, a2} H' V'q — (cosa, sina), 
i^fi-.Vp -^ M:{/3i,/32}^V'/3 = (cos/3,sin/3). 



This algorithm configures an example of Theoru \3.2\ 
3. The same as Definition 15.113. 



4- B and C select for measurement one of the elements (letters) in Va orVp, 
independently. Before measurement, B (C) rotates a detector angle up to 
9b (0c)- Neither knows the rotating angle of the other. B and C correct 
this rotation angle after completing all measurements. This rotation does 
not affect the error of the measurement, owing to Theorem 14.11 

(a) If the angles a and (3 exist locally before the measurement, the prob- 
ability that B may obtain each letter can be obtained as 



Ms(ai) = cos^ {a - 


~0b), 


A*s(a2) = 


- sin (a - 


-Ob 


flBiM = cos^/3 - 


-Ob). 


Mh) -- 


= sin^ [13 - 


'0b 



The probability for C is similar to the above. In this case we do not 
observe any violation of Bell's inequality since we can prepare the 
complete probability table. 



Table 1: Part of the probability table for stochastic double code-transfer exper- 
iment without a complete probability table. If B and C decided to measure a 
letter from same set A, probability amplitude would have to follow this table. 







C 




ai — a 


a2 = 7r/2 — a 


B 


ai 


cos{0c-0b)cos{a-0c)/2 


-sin(6'c-6'6)sin(a-6ic)/2 


a2 


sm{0c-0b)cos{a-0c)/2 


cos{0c-0b)sm{a-0c)/2 
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(b) Suppose the angles a and /3 are not fixed before measurement and are 
fixed when BorC measure the code from Va orVp. For example, when 
C obtains ai (02), the angle a for B is fixed as a — 9c (a = 7:12 + 9^), 
i.e., the probability table is now situation- dependent. The probability 
table is listed in Table [TJ // S decided to measure a letter from a set 
Vp, nothing would happen. The probability can be calculated as 

A^s(ai) = {cos{a ~ 9b)cos{a — 9c)/2) + (cos (a — 6*;,) sin (a — 0c)/2) 
= cos^ (a - 6'6)/4, 

Mb(q;2) = (sin(a — f?;,) cos(a — 0c)/2) + (sin (a — ^f,) sin(a — 0c)/2) 
= sin^ {a - 6lb)/4. 

Needless to say, the probability to obtain one of the a can be calculated 
using only local parameters on B. On the other hand, if B decided to 
measure a letter from the same set as C, then 

Ais(ai) = {cos{9c-eb)cos{a-ec)/2-sm{ea-eb)sm{a-ea)/2f 



= cos^ (a - 9b)/4:, 
Ais(a2) = {sin {0c - 9b) cos {a- 9c) /2 + cos {0c -0b) sin {a ~ 0c) /2) 
= sin^ (a - 0b)/4:. 

Again the probability to obtain one of the a can be calculated using 
only local parameters onB. In both cases, B can obtain a set of letters 
that A intended to send. The probability table is situation- dependent 
and there is a possibility that Bell's inequality will be violated. 

5. 6. The same as the Definition 15.11 

Bell's inequality is tested numerically according to the above example. A 
correlation between measured letters independently obtained by B and C is 
defined as 

A5 = S{a^^\a^^) - {S{a^^\l3j^) + S{l3j^\lij^) + S{|3■J^\a^^)) . 

According to the results of Theorem 15.21 AS* is bounded by negative values 
when the complete probability table exists. If the theory is based on local 
realism, one can always prepare the complete table to observe letters for both 
B and C. In order to design the experiment that gives a stronger correlation 
(AS* > 0), one has to employ a rule for choosing the probability table, i.e., a 
choice that cannot be determined locally. Moreover, the rule must also satisfy 
requirements from special relativity, if one would like to interpret as physical law. 
The stochastic double code-transfer experiment defined by Definition 15. 3l is an 
example of such a rule. Under Definition l5.3b . for instance, B cannot know the 
probability table he is using because it depends on C's decision, and that cannot 
be known by B. However, if we look at only S's results, we cannot extract any 
information about C's choices and results; that means C's information cannot 
transferred to B immediately, which is a requirement from special relativity. 
This coexistence of nonlocality and special relativity is realised by the rule 
of Definition 15.3b of the stochastic double code-transfer experiment. The 
probability tables, /^s(ai) and /is (02), include 9c, though these are tables for B, 
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Figure 3: AS" for the stochastic double code-transfer experiment without a 
compfete probabihty table. It can be seen that Bell's inequality is broken in a 
part of parameter region. 



which is called " entanglement^ However, B cannot extract a value of 6c because 
9c appears only in phase of the unitary transformation and disappears after 
reaching the average. This trick can be implemented because the probability 
table represents probability amplitude. 

Violation of Bell's inequality can be judged by checking whether the corre- 
lation AS is greater than zero or not. Numerical results with employing rule of 
Definition 15.3b are calculated and shown in Fig. [3] One can clearly see the 
violation of Bell's inequality in some parameter regions. 

6 Summary 

In this report, a basic definition of quantum mechanics concerning its static 
aspect is proposed. Here, a dynamic aspect of quantum mechanics is not treated. 
The simple code-transfer experiment which satisfies the definition of quantum 
mechanics is designed to investigate some of it aspects. Then it is proved that 
a method using probability amplitude gives the minimum error for the physical 
observables using information theory. Also, it is shown that the size of the error 
doesn't depend on parametrisation of the coding. Nonlocal realism is one of 
the most essential parts of the nature of quantum mechanics. It is shown that 
quantum mechanics defined here can include nonlocal realism for the double- 
code transfer experiment introduced by extending a code-transfer experiment, 
above. We showed that the quantum mechanics defined here can violate Bell's 
inequality, thanks to the property of the probability amplitude. 

In conclusion, the probability amplitude rather than probability density gives 
the minimum and independent mean-square errors from parametrisation. More- 
over, it allows one to obtain nontrivial and nonlocal correlation on two indepen- 
dent measurements which violate Bell's inequality. 



13 



Acknowledgments 

We wish to thank to Dr. Y. Sugiyania, Profs. T. Kancko, K. Kato, T. Kon, and 
F. Yuasa for their continuous encouragement and fruitful discussions. We also 
wish to thank Prof. Tsutsui for his excellent lecture about advanced quantum 
mechanics. We would like to thank Enago for the English language review. 

References 



[9 

[lo; 

[11 
[12: 

[13; 

[14 

[15' 
[16 



J.S. BcU. Physics, 61:195-200, 1964. 

Max Born. Quantenmechanik der stofivorgange. Zeitschrift fiir Physik A 
Hadrons and Nuclei, 38:803-827, 1926. 10.1007/BF01397184. 

S. L. Braunstcin and C. M. Caves. Phys. Rev. Lett., 61:662, 1988. 

L. Brekke and P.G.O. Freund. p-adic numbers in physics. Physics Reports, 
233(1):1 - 66, 1993. 

L. N. Chang, Z. Lewis, D Minic, and T. Takeuchi. Galois field quantum 
mechanics. Modern Physics Letters B, 27(10):1350064, 2013. 

T. Cover and J. Thomas. Elements of information theory. Wiley, New 
York, 1991. 

H Cramer. Mathematical Method of Statisticas. Princeton University Press, 
1946. 

R. A. Fisher. Proc. R. Soc. Edinburgh, 42:321, 1922. 

Y. Kurihara. Classical Information-Theoretical View of Physical Measure- 
ments and Generalized Uncertainty Relations. ArXiv e-prints, December 
2012. 

Y. Nambu. Field theory of galois fields. In A. Batalin et al., editor, Quan- 
tum Field Theory and Quantum Statistics, Vol I. lOP Publishing, 1987. 

C. R. Rao. Information and accuracy obtainable in the estimation of sta- 
tistical parameters. Bull. Calcutta Math. Soc, 37:81, 1945. 

V.V.S. Vladimirov, I.V. Volovic, and E.I. Zelenov. P-adic Analysis and 
Mathematical Analysis. Series on Soviet and East European Mathematics. 
World Scientific Publishing Company, Incorporated, 1994. 

John A. Wheeler. Information, physics, quantum: The search for links. 
Physics Dept., University of Texas, 1990. 

John A. Wheeler. Sakharov revisited: It from bit. In L. V. Keldysh 
and V. Ya. Fainberg, editors. Proceedings, Sakharov memorial lectures in 
physics, vol. 2, pages 751-770, Mosccow, 1991. Nova Science Publishers. 

W. K. Wootters. Information is maximised in photon polarization measure- 
ments. In A. R. Marlow, editor. Quantum Theory and Gravitation, pages 
13-26, Bostosn, 1980. Academic Press. 

W. K. Wootters. Phy. Rev., D23:357, 1981. 



14 



A Appendix 

A.l Classical estimation theory 

We define terms associated with physical measurement according to classical 
estimation theory[5] as follows. Let ^ be a random variable for a given physical 
system described by the iV-tuple = {6*1, • • • , 6'Ar}, where 9i is the i th physical 
parameter. The set of all possible values of Oi € K, denoted by &, is called 
the parameter set. The random variable ^ is distributed according to the 
probability density function f(a;; 6) > 0, which is normalized as J „ dx f(x; 0) = 
1, where a; S R is one possible value of the whole event (= ft). For physical 
applications, we introduce the probability amplitude defined by 

\i;{x;ef=i{x;e). 

A part of experimental apparatus is assumed to output numbers distributed 
according to the probability density. Any resulting set of numbers ^n = 
{xi, • • • ,Xn}, drawn independently and identically distributed (i.i.d.), is called 
the experim,ental data. The estimate of the physical parameter is called a mea- 
surement. Because experimental data are i.i.d., the corresponding probability 
density function can be expressed as a product: 

n 

f(^„;0) = [] f(a;,;0). 
i=i 

A function mapping the experimental data to one possible value of the parameter 
set such as 

Ti : jr„ -^ : {xi, • • • , Xn] ^ Oi 

is called an estimator for the ith physical parameter, denoted by Ti{J%'n) = 9i. 
The experimental error in the ith physical parameter is defined as the root mean 
square error: 

e, =i?[(T,(S;)-0,)Y/', 

where 9i is the true value of the i th physical parameter. True values of physi- 
cal parameters are typically unknown, but a mean-square error can be reduced 
below any desired value by accumulating a sufficiently large amount of exper- 
imental data, thanks to the law of large numbers. If the mean value of the 
experimental error is zero, i.e., Eg.[9i — 9i\ = 0, after accumulation of infinitely 
many statistics, that estimator is called an unbiased estimator. Among such 
estimators, the one giving the least error is called the best estimator. 

A. 2 Information theory 

For a probability space (51,^, P) and probability variable X defined on it, in- 
formation entropy S{X) is defined as 



5(X) = -^P(x)logP(x). 



xeo 
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S{X) > immediately follows from < P < 1. For two probability variable 
X,Y whose domains are flx,^y, where n^T^y C 17, a joint entropy is defined 
as 

S{X,Y)^-Y, ^ P(a;ny)logP(xny), 

Xf^itj: y€^y 

where P{x n y) is a probability to observe a; in X and y iny , simultaneously. A 
conditional entropy is defined as 

S{Y\X) = - ^ ^ P(x n y) logP(y|x), 

Where P{y\x) is conditional probability to observe y in Y when x inX is ob- 
tained. On those entropies, following formulae are obtained: 

SiX,Y) = SiX) + SiY\X), 
S{X\Y) < S{X). 
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